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ADVANCES IN MECHANIZATION OF PATENT SEARCHING’ 


HISTORY 


A previous experiment in mechanized search- 
ing inthe U.S. Patent Office, using punched cards 
for the storage and retrieval of chemical com- 
positions of matter disclosures, has been de- 
scribed (1). That experiment yielded encour- 
aging results with respect to the feasibility of 
employing machine techniques for the complex 
types of searches required by patent examiners. 

Subsequent to that effort, aCommittee adviso- 
ry to the Secretary of Commerce was formed 
under the Chairmanship of Dr. Vannevar Bush. 
The purpose of the Committee was to study the 
Patent Office search procedures and problems 
as well as the prior experiment and to report on 
possible application of mechanized techniques of 
searching to speed up the handling of applica- 
tions by providing more rapid and efficient 
searches. 

In accordance with the Committee’s recom- 
mendations (2) the Patent Office, with the coop- 
eration of the National Bureau of Standards, is 
now engaged in an effort to put machine search- 
ing of compositions of matter on an operational 
basis. To stimulate and coordinate the program, 
an Advisory Committee to the Secretary of Com- 
merce has been formed. The Patent Office An- 
nual Report for the Fiscal Year 1955 refers to 
the general features and status of the previous 
and present programs (3). 

In view of the broad program now contem- 
plated, the mechanized search system must 
provide much greater flexibility in searching 
with respect to types of subject matter and vari- 
ation of search viewpoint than was available ac- 
cording to the earlier mechanization system. 
This article will discuss briefly some of the 
general features of the system which have been 
developed subsequent to the work described in 
the earlier article. 


SOME RELATIONSHIPS TO BE DEALT WITH 


In every patent search it is required that all 
disclosures be located which are of relevance 
to a particular point of view. The point of view 
varies with each search and it is desired, in 
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mechanization, to enable machine manipu!ation 
of the disclosure data so as to retrieve z:] in- 
formation pertinent to any of a number of con- 
stantly variable search viewpoints. 

To illustrate some of the considerations in- 
volved, a searcher may desire to find all docu- 
ments disclosing a mixture of compounds A and 
B used as a lubricating composition. Or he may 
be interested in discovering what uses have been 
disclosed for a mixture of A and B. In the first 
situation, the search requirement is satisfied 
by a disclosure of A mixed with B only where a 
lubricating function is also found associated with 
that mixture, whereas the second search re- 
quirement is satisfied by a disclosure of admix- 
tures of A and B, regardless of function. As 
another example, the disclosure of boric acid 
mixed with alcohol will not. satisfy a searcher 
whose interest is in compounds containing both 
boron and a hydroxyl group, for while the two 
named things are disclosed, their relationship 
is not in accordance with the search require- 
ment. 

It is often necessary to relate functions to 
pertinent materials. Assume, for example, that 
a mixture of compounds A and B forms an in- 
secticide, although neither of them has that prop- 
erty individually, and that upon the addition of 
compound C a lubricant is obtained, the insect- 
icidal characteristic not being present in the 3 
component mixture. The insecticidal function is 
associated with each of A and B but not with C, 
whereas the lubricant function is associated with 
each of A, BandC. Thisdisclosure may be rep- 
resented as follows: 


insecticide 
an 
A B Cc 
NN 
lubricant 


These properties may be considered as “dis- 
tributed to” or “shared by” each of the materials 
which contribute to the existence of the proper- 
ties so that the representation may be trans- 
cribed into the following form: 


A B 
D - insecticide D - insecticide 
D - lubricant D - lubricant 
© 


D - lubricant 


: 
: 
; 


where the “D” indicates a distributed or shared 
property. Since the three compounds posses 
these properties not individually, but only in cer- 
tain mixture combinations, a distinction must be 
made between the shared function of each com- 
pound which contributes to the function of the en- 
tire admixture and the function of the individual 
ingredient only. 

Some materials are disclosed only functional- 
ly. Take, for example, a mixture of nitrocellu- 
lose, a pigment and a solvent. The latter two 
terms represent particular materials, but they 
are described only in terms of their functional 
characteristics. Consequently, while materials 
disclosed in this manner can be retrieved in 
terms of function, they cannot be retrieved in 
terms of their chemical structure. 

A disclosure may show a relationship of al- 
ternativeness with or without equivalence. Thus 
the disclosure that “Compound X may be mixed 
with either water or gasoline” indicates that 
water is alternative to gasoline with respect to 
admixture with compound X. If when water is 
used, the composition becomes a mouthwash 
whereas with the use of gasoline an insecticide 
is formed, the relationship is one of mere al- 
ternativeness without equivalence. However, if 
the same utility is obtained with either water or 
gasoline mixed with compound X, a condition of 
equivalence exists between the water and gaso- 
line with respect to the utility involved, in addi- 
tion to their alternative relationship. 

There are many known materials of complex 
nature which are themselves admixtures of 
several chemical compounds. Many biological 
extracts, plant or animal parts or complex re- 
action mixtures, for example, contain a number 
of chemical compounds. It is necessary to re- 
cord each of these compounds and to associate 
the several compounds together interms of their 
being contained in a common mixture having a 
single source. An example of why this is so is 
that a disclosure of milk in admixture with 
sodium carbonate should be available to one in- 
terested in casein mixed with sodium carbonate 
as well as to one interested ina mixture of milk 
and sodium carbonate. 


Several compounds may each be described in 
terms of different relationships to a particular 
chemical process, some for example as being 
Starting materials, others as final products. In 
many disclosures the product of a chemical re- 
action iS not structurally described, and prod- 
ucts of this type may be best retrieved in terms 
of their relationship to the process of making 
them. For example, product L may be described 
solely as the product of the oxidation of com- 
pound M using compound V as the oxidizing agent. 


THE CLASS CONCEPT 


The concept of classes is fundamental to the 
search system. A search for an alcohol is 4 
search for any and all things having the charac- 
teristics which define the class of alcohols. A 
search for a bactericide is a search for any and 
all members of the class defined in terms oO 
that function. A class may, of course, be ex- 
pressed in terms of multiple characteristics, 
such as the class of bactericidal alcohols. 

Since patent searching requires the finding of 
similarities, all of the class relationships which 
can be discerned among the pertinent subject 
matter must be maintained. The machine lan- 
guage used to describe a disclosure should be 
such as to express the various Class relation- 
ships. That is, the codes for a material such aS 
aniline must bear sufficient relationship to the 
codes for benzene to permit retrieval of dis- 
closures of both of these materials in a search 
for benzene compounds and to permit rejection 
of one and selection of the other in a search for 
an anilino compound. 

The concept of class relationships and the ap- 
plication of the system thereto may be illustrated 
by Figure 1. An attempt has been made at uni- 
versality, i. e., without limitation to any specific 
type of class. Boolean symbology is employed. 

Let A, B and C represent classes defined in 
terms of characteristics A, B andC, respective- 
ly. When it is said that Class C is defined in 
terms of characteristic C it is not meant to say 
that each member of the class is C, but rather 
that each member of the class has, or may be 
describedin terms of the characteristics of “C— 
ness”. Class A includes as its members things 
which are members of Class A and no other class 
and members of Class A and some other clasS 
(X representing any other class). Assuming 
complete identification of all things, so that 
each thing is uniquely delineated from all other 
things, Class AMX would contain only one mem-— 
ber. If the characteristic defining Class Ais 
the benzene structure, the compound benzene 
would be the sole member of Class ANX. A 
member of Class ANX would be any compound 
containing the benzene configuration together 
with other groups attached thereto. Therefore, 
Class A, which is generic to ANX and AMX, 
would include as members benzene itself and 
all compounds containing the benzene structure 
somewhere in the molecule. The same princi- 
ple is similarly shown for two other distinct 
classes, B and C. 

Continuing this analysis, Class AMX includes 
things which are members of Class ANBNX, 
Class ANCNX, Class ANBNX and Class ANCNX- 
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It will be noted that the same member may be 
included within the scope of two different 
classes. Thus ANBNX is a member of ANX and 
also of BAX. If B is aclass defined in terms of 
the amino characteristic, aniline, which is a 
member of Class ANBNX is also a member of 
the benzene class, A, as well as a member of 
amino class, B. If we follow the diagram down, 
we can see that a compound which is a member 
of Class ANBNCNX is also amember of Clas -es 
A, B, C, AABAX, ANCNX and BNCNX. 

A search for a benzene compound isa generic 
search for all compounds containing the benzene 
nucleus and would include the unique compound 
benzene and compounds such as aniline which 
contain the benzene ring as a fragment thereof. 
A search request for anilino compounds excludes 
benzene but includes aniline and compounds con- 
taining the anilino group as a fragment thereof. 
A search request for aniline specifically includes 
aniline only (ANBNX). 

While these illustrations have been with re- 
spect to structural characteristics of organic 
compounds, the principles are similarly appli- 
cable to structural characteristics of inorganic 
compounds and to any type of characteristic 
such as a use, property, process condition, com- 
position, etc. Thus, if A is theclass of benzene 
compounds, B is a class of starting materials 
and C is a class of oxidation processes, the 
combination of these characteristics according 
to a designated relationship will specify a class 
containing all benzene compounds involved as 
Starting materials in oxidation processes. 


TYPES OF DISCLOSURE UNITS 


For convenience in machine manipulation the 
patent data have been analyzed into elementary 
bits of disclosure units which are characterized 
as belonging to one or the other of two major 
types of subject matter, the substantive type and 
the relationship type. The substantive type in- 
cludes, for example, the concept of a chemical 
element, whether in free form or in chemical 
combination, the particular manipulative steps 
and conditions of a physical or chemical process, 
the description of complex materials and ex- 
tracts, the functions and behavior of materials. 
The relationship type pertains to the various as- 
sociations among the substantive types, such as 
the way in which the elements are arranged in a 
structural formula portrayal of a chemical com- 
pound, the association of several materials asa 
physical admixture, the relationship of equiva- 
lence of several alternative intermediates for a 
chemical preparation or the fact that A is re- 
lated to B as intermediate to final product in a 
given process. i 


SCHEDULES 


For the purpose of assigning codes to the sub- 
stantive disclosures, it is necessary to draw up 
schedules of the structural and functional char- 
acteristics which are considered to be pertinent 
to patent searching. While many of the disad- 
vantages of conventional classification were 
avoided in the earlier mechanization method by 
the way in which these scheduled terms were 
applied, namely, by multiple coding and the 
elimination of the superiority concept, the 
limitations encountered by a generic and specific 
hierarchy of combinations of characteristics 
still existed. To construct and use a schedule 
of the magnitude which would be required to 
provide for all possible combinations of the 
more elementary terms would be a practical 
impossibility. Therefore, in the present mech- 
anization system a different method of schedule 
arrangement is being employed. By this method, 
all subdivisions of any particular generic or 
class term pertain to variations of the charac- 
teristics of that genus alone and do not involve 
characteristics of that genus combined with any 
characteristics pertaining to a different genus. 
In other words, indented under Class A will not 
appear any subordinate term which embodies 
characteristics of Class B. 

A schedule of elements may list them by 
atomic number, by atomic weight, alphabetically 
or in any other arrangement, and to each will 
be assigned a unique code. The periodic group 
will also be coded for each element. A schedule 
can be set up for the metallic characteristics of 
the element: 


Non-Metal 

Metal 
Light Metal 
Heavy Metal. 


A term such as “amine” will not have such terms 
as “with hydroxyl” or “with halogen” indented 
under it, but such terms as “primary” or “seC- 
ondary” can be indented under “amine.” 

In effect, each generic term will form a S€P- 
arate relatively short schedule and disclosure 
items will be coded from as many of such 
schedules as required. The relationships among 
the several codes which pertain to a single item 
will be shown by devices which will be described 
later in this article. Each schedule term wi 
have a generic meaning and by means of the 
modulating device, also described in more de- 
tail later in this article, its particular variatiO" 
of meaning will be indicated. The term “oxid4- 
tion,” for example, will always have the sam® 
general meaning, but it will be modulated to i9” 
dicate its relationship to an item which is 2" 
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oxidizing agent, a thing which becomes oxidized, 
and so forth. 

Use of the mechanization system by the 
searcher involves selecting characteristics per- 
tinent to each patent search and relating them 
into the classes which designate the point of 
view of the searcher so that selection by ma- 
chine can be made accordingly. This is com- 
plementary to the method of encoding disclo- 
sures, which involves coding the various factors 
from a plurality of viewpoints. Where, for ex- 
ample, a material is disclosed as having struc- 
tural characteristic A, such asthe benzene ring, 
and function F, such as solvent, both of these 
aspects of the material are coded, and one of 
the relationship devices to be described ties the 
two terms together as plural descriptors of a 
particular entity, that is, AF refers to a single 
material having the structure A and the function 
F. A search for A, regardless of function, would 
inyolve the recognition of A within the code AF 
and would result in retrieving the disclosure of 
material AF. A search for any material of un- 
designated structure which has the function F, 
would also retrieve the disclosure of AF. 


RELATIONSHIP DEVICES 


The relationships may be handled in various 
ways. Two of the devices employed at present 
are “grouping signals” and “interfixes.” If a 
disclosure mentions compounds A, B, C and D 
and states that the first two are mixed together 
and the last two are mixed together, the use of a 
grouping signal might illustrate the situation as 
(A, B) and (C, D). Here the parentheses would, 
by a pre-established definition, be known to en- 
close several things in an admixture relation- 
ship, materials in different sets of parentheses 
therefore being in different mixtures. To illus- 
trate the use of an interfix to describe the same 
situation we mighthave: AlB1C2D2. Here, al- 
so by a pre-established definition, those com- 
pounds having the same numerical attachment 
are in one mixture and those having different 
numerical attachments are indifferent mixtures. 
The absolute value of the numbers used is of no 
significance, but their identity or lack of identity 
is meaningful. 

Interfixes can also be used to identify a time 
sequence relationship. For example, a process 
may include a series of steps wherein reaction 
P is followed by reaction Q and this in turn is 
followed by reaction R. This could be illustrated 
as Pl Q2 R38. The time sequence may be dis- 
covered by applying the defining rule that the 
larger the interfix number, the later in time is 
the reaction associated with that number, or 
stated another way, any reaction is prior intime 
to one having a larger interfix number. If the 


process were N1 P1 Q2 R3, it wouldbe apparent 
that N and P are carried out simultaneously and 
that both reaction N and reaction P precede Q 
and R. This might be the case where N and P 
each results in separate products, these separate 
products are reacted together in Q, and the ma- 
terial resulting from Q is treated intheR reac- 
tion. 

Other factors in the reaction, such as temper- 
ature, pressure, proportions, manipulative op- 
erations and the role of a certain material as 
catalyst are also encoded and their relationships 
to the materials involved may be shown by the 
use of interfixes. 

Alternative relationships can be indicated by 
the use of interfixes. For example, a disclo- 
sure may be that compound A is present together 
with either compound B or compound C, but not 
with both B and C, i. e., (A) + [(B) or (C)] . The 
disclosure may then be coded: Al-2 B1 C2. A 
search for A mixed with B and a search for A 
mixed with C would each retrieve this disclosure 
since in each case the two materials sought have 
the same interfix. However, asearch fora mix- 
ture of B and C would not retrieve this disclo- 
sure since B and C have different interfixes. 

Grouping signals may be used to show certain 
features of the organization of the fragments of 
a disclosure. As will be shown in more detail, 
codes for the structures of compounds are based 
on codes for the elements involved. The codes 
for the structure of a compound and those for 
its functions can be associated by grouping sig- 
nals. Likewise the several compounds making up 
a mixture can be related to eachother by higher 
order grouping signals, and several composi- 
tions disclosed in a process can be grouped to- 
gether by yet higher order signals. For example, 


{Ar BFR | (Ce (DF 2] [(fFE \(GF, 3} 


would be the plan for the coding of a process in 
which two compounds having structural charac- 
teristics A and B, respectively, are reacted to- 
gether to form product C, followed by a reac- 
tion of C with D to form a mixture of products 
E andG. F represents the functionof each ma- 
terial indicated in the subscript of F. The nu- 
merals 1, 2 and 3 are interfixes to indicate the 
reaction sequences. The parentheses, square 
brackets and bowed brackets represent succes- 
sively higher orders of grouping signals. The 
parentheses group together the plural descrip- 
tors of an item, the square brackets group to- 
gether the items which are simultaneously 
present as a composition or admixture and the 
bowed brackets group together the several 
compositions relating to the same process. The 
symbols A, B, C, D, E, and G in this example 
represent all of the codes for the elements con- 
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tained in each of these compounds and their 
Structural arrangements. 


CODING CHEMICAL STRUCTURES 


Several different systems for coding struc- 
tures of compounds have been developed, all 
based on the same principles but differing in 
machine manipulation. An attempt has been 
made to keep the system as universal as pos- 
sible so that no limitations would be imposed 
as to the type of machine to be used. However, 
for purposes of illustration it is necessary to 
select a particular system, with the understand- 
ing that such system may be modified according 
to the nature of the machine ultimately used to 
make searches. The structural formula is taken 
as the basis for coding the structural charac- 
teristics of chemical compounds. The method 
of coding takes cognizance of each element 
present in the chemical compound and its 
graphic structural relationship to every other 
element so that any selected fragment of the 
entire compound can be recognized and retrieved 
by machine when a search is made for the class 
of compounds containing that fragment. In addi- 
tion, the complete configuration gives a code 
uniquely different from the code for any other 
compound. 

To illustrate coding using the interfix device, 
a ring system such as acridine (Fig. 2) 


Figure 2. 
may be coded as follows: 


(I) ré6/Clc2cccccic2cccr 


2 o_o = 
(il) r 6/C1C C3 C4NC2C1CC3C4Nr 


(lll) r 6/C3 CCCCC4C3CCCCr 


The underlines and brackets attached to codes 
I and II do not relate to the coding of acridine 
but will be referred to subsequently in the search 
descriptions. Each element will have a code 
identification but for ease in visual interpreta- 
tion the conventional symbols for the elements 


are used. Each ringis individually coded, start- 
ing at any point of the ring and going clockw!se, 
recording the elements in the Sequence in which 
they occur in the ring. “r” represents a COde 
designation for grouping signals indicating a ring 
and “6” indicates that there are 6 members in 
the ring. The numerals 1 thru 4 are interfixes 
which indicate a shared element, e. g., Cl is 
common to both ring I and ring II. 

For reasons that will become apparent the 
coding sequence is continued for more than a 
single encirclement of the ring, and the total 
number of elements coded is calculated as2 Nn - 
1, where n is the number of elements in the 
ring. 

A search request for a pyridine compound, 
generically, would be a search for a disclosure 
of the grouping of Figure 3 


ae 
| | 
Bee 


Figure 3. 


whether it appears as pyridine per se or asa 
fragment of a larger molecule such as acridine 
in Fig. 2. Starting at any point of the ring, for 
example the carbon marked with the asterisk, 
the requirement is to find the element sequence 
r6/--CCNCCC--r within the stored ma- 
terial containing the coded information. In this 
search the numerals after the C’s inthe acridine 
code are ignored, since they describe how rings 
are joined, which feature is outside the scope 
of the search request. It will be seen that the 
sequence sought appears in the code for ring II 
(bracketed portions a). If the searcher were to 
start his sequence at any other place in the 
ring, e.g., Starting at the nitrogen, the sequence 
would be r 6/- - NC C CC C--r which, it will 
be seen, also appears in the code for ring II of 
acridine (bracketed portions b). Because of the 
coding to 2 n- 1, the searcher needs no fixed 
point of reference for starting to specify the 
sequence desired, such sequence being picked 
up at any point in the 2 n - 1 length code. 
A search for a quinoline (Fig. 4), 


Sera 


generically, will be expressed as 
r6/-- Cn Gn CCGCC--r 


r 6/ - - CnC CCN Cn’ - - r, where“n’” is 
any number and “n’” is any different number. 
This code will be found within the codes for 
rings I and II of acridine at the underlined por- 
tions. The non-pertinent numerals 3 and 4 are 
ignored. The requirement is to find the desired 
pattern within the larger pattern, or, in effect, 
to see whether the requested code subtracts 
from the stored code. Note that the searcher 
will not know what interfix numbers were as- 
signed to a particular structure and he can only 
specify in his search request the sameness or 
difference of numbers associated with particular 
elements. 

As another example, take the straight chain 


H-O- CH, - CH, -O- CH, - CH,Br. 


In coding, the non-significant hydrogens are left 
out, as in the ring compounds of the previous 
examples, i.e., those not needed for identifica- 
tion of a structural group are not coded. The 
skeleton of the chain will then appear as follows: 


H-O-C-C-O-C-C-Br 


The code symbols for these elements will be set 
forth as follows in a sequence which will be 
handled by the machine in sucha way that it may 
be read from both left to right and right to left: 


Pi ea —_— 
QH-O-C-C-O-C-C-BrQ, where Q 
represents a code designation for grouping sig- 
nals indicating a chain. 

A search for an ether will be set forth as 
Q--C-O-C- -Q, which it will be seen is 
within the stored code in the bracketed portions. 

A search for an ether alcohol in which the two 
oxygens involved are in al - 4relationship, will 
be set forth as Q-- H-O-C-C-O-C--Q, 
which is found within the code in the portions 
underlined. 

For a combination of chain and ring, consider 


the compound (Figure 5) 


(ey ‘ T C-C-Br 
I II H 
10 C2 con 
a ee Ng 
Figure 5. 


the code for which is expressed as 


(I) r6/C1C2CC3CCC1C2CC3Cr 


(Il) r 6/C1C C4 C5 CC2C1CC4¥C5Cr 
QHO3Q 
QBrCC4Q 
QHNSHQ 


The structural groups OH, NH _ and CCBr are 
treated as straight chains. It is seen that two 
major types of groups, or what may be termed 
“building blocks,” are recognized, the chains 
and the rings. Chemical compounds may con- 
tain a chain joined to a ring and a ring joined to 
a ring. A chain joined to a chain is, of course, 
one chain. The joining ofchaintoring is by way 
of a bond juncture. The joining of ring to ring 
can be by bond juncture, as in diphenyl, or by 
element sharing, as in naphthalene. The joining 
relationship among these “building blocks” is 
indicated by an interfix number which follows 
the element of the block wherein this relation- 
ship exists. This particular kind of relationship 
is identified in the Fig. 5 example by the dots 
beneath numbers 3, 4 and 5 to indicate bond 
sharing, and the absence of the dots beneath num- 
bers 1 and 2 to indicate element sharing. 
A search for an anilino compound (Fig. 6) 


Figure 6. 
will be expressed as r 6/--Cn CCCCC--r 
QHNnHQ 


This pattern is found in the underlined portions 
of the Fig. 5 compound code. 

Similarly, a search may be made fora phenol, 
for a bromoethyl naphthalene, or for a naphthol, 
and so on, and the machine would select disclo- 
sures of the compound of Fig. 5 as well as dis- 
closures of all other compounds containing the 
structural group expressed in the search re- 
quest, and, therefore belonging to the particu- 
lar generic class desired. When the search 
question is so specific as to include only one 
member within the class, e. g., ethyl alcohol, 2, 
5 dinitro-aniline, Peruvian bark, the code sought 
will be one representing an “index number,’’ 
which is a unique code representing a specific 
compound, material or other particular thing. 
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The index number will be coded as an additional 
descriptor for the item it represents. Search- 
ing by index number will retrieve only disclo- 
sures of the specific concept and disclosures 
generic to the specific thing denoted will not be 
retrieved. 

It can be seen from the preceding examples 
that the elements in a ring or ina straight chain 
are coded in the sequence in which they occur 
in the structure of the ring or straight chain. 
Branched chain structures are treated in a some- 
what different manner because of several com- 
plex problems which they present. The methods 
of coding branched chains will be described ina 
subsequent article. 


MODULANTS 


One method mentioned above for indicating 
some of the contextual relationships of the ma- 
terials is by the assignment to each materia] 
of the function pertaining to it, and modulation 
of the function to indicate the role of the mate- 
rial with respect to that function. 

Modulants may be scheduled as follows: 


Potential (active) - 33 
Potential fulfilled (active) - 33 - 40 

Potential (passive) - 44 
Potential fulfilled (passive) - 44 - 40 

Resultant from - 55 


Considering the term “diazotization,’’ which 
is denoted by code 12, modulation of this func- 
tion would result in the following set of codes: 


Diazotization (generic concept) - 12 


Diazotizing agent - 12-33 
Diazotizes - 12-33-40 
Diazotizable - 12-44 
Is diazotized - 12-44-40 
Diazotized compound - 12-55 


Each term of function may be thus modified as 
desired by affixing to it the proper modulant 
and the resulting code is assigned to each ma- 
terial to which it pertains. 

Also, each material in a chemical reaction is 
assigned a code specifying the process type in 
which the material is involved, and the process 
or function code is modulated to indicate wheth- 
er the Compound is a starting material or prod- 
uct produced by the reaction. 


A disclosure that A+B diazotize C to make D 
may be coded as: 


(A-SM - 12-33-40) + (B- SM - 12-33 - 40) 
+ (C-SM- 12-44-40)» (D- FP - 12-55) 


where SM stands for starting material, FP for 
final product and A, B, C and D stand for the 
structural characteristics of compounds A, B, 
C andD, Note thattheverbal function “diazotiza- 
tion” is distributed to each of the compounds 
related to diazotization. 

A search for a diazotization process involving 
the production of D will be expressed as (D - 
FP - 12-55). These codes will be found to occur 
within the above disclosure. 

A search for “A” as a diazotizing agent will 
be: 

(A - 12 - 33), which it is noted will select the 
above disclosure, the potential action or adjec- 
tive form “Diazotizing agent,” being generic to 
the action or verbal form “Diazotizes,” since a 
disclosure that A diazotizes B indicates that A 
is a diazotizing agent. 

By the device of modulation a variety of mean- 
ings may be associated with any particular 
schedule term. By using the following set of 
modulants, for example, any term may be modi- 
fied according to its role in the context of the 
disclosure. 


Disease of - 50 

Disease by - 60 
Infection by - 60 - 10 
Toxicity by - 60 - 20 

Ingredient - 70 


If the term for a particular plant suchas a mold 
is associated with the code 60 - 10, the idea is 
expressed of a disease by infection with that 
mold. If the code 50 is associated with the code 
for the mold, it is thereby indicated that the mold 
is itself diseased. By association of code 70 with 
the mold, it is indicated to be present as an in- 
gredient in the composition. The use of code 
60 - 20 with a particular material expresses the 
throught of a disease of poisoning by the indi- 
cated material. 

The procedures described are still in tentative 
form and it is expected that many modifications 
may ensue as the work on mechanization pro- 
ceeds. Further details will be presented in 
future articles. 

Cooperative efforts and contributions made 
in connection with the described developments 
by D. D. Andrews, Director of the Office of Re- 
search and Development, and I. Marcus and 
H. Pfeffer, of the Chemical Staff of the same 
Office, are gratefully acknowledged. 
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